10-0186  318  A  MOTE  OM  EXTENDED  QUASI-LIKELIHQOD<U>  NORTH  CAROL INfl  1/1 
UN IV  AT  CHAPEL  HILL  DEPT  OF  STATISTICS 
H  DAVIDIAN  ET  AL  FEB  87  HHS-1716  AFOSR-TR-87-1132 
UNCLASSIFIED  F49620-85-C-0144  F/G  12/3  NL 


AD-A186  318 


la  AfcPORT  SfcC*-.A*7>  '‘lASSH-'CAI  ION 

UNCI.ASSII  ILL'  _ _ 


;,£C~R‘TV  CL-»SSiF.CATiGN  muTmORiTv 


2b  OEClASSiFiCATION  DOWNGRADING  SChEOL/lE 


REPORT  DOCUMENTATION  PAGE 

’  ..  ™™___  ________________ 


|4  PERFORM ING  ORGANIZATION  REPORT  NuMSERiSI 

$  MIMliO  SERIES  »  1 7 1 1> 


So  OFF.CE  3  >  M80  L 

/  f  applicable 


\W>S 


6*.  NAME  OF  PERFORMING  ORGANIZATION  |bo  OFFICE  SYMBOL 

if  appucabie  • 

Univ  "f  NC,  Chapel  Hill  _ 


6c.  ADQRESS  y,  State  ana  /.IP  Codei 

Stat.  Dept. 

UNC-CH,  Chapel  Hill,  NC  J751-1 


8a.  NAME  OF  FUNOiNG/SPONSOR-NG 
ORGANIZATION 

Al'OSR 


8c.  ACLpRESS  i  City.  State  and  /.IP  Coo*’1 

o 

Bolling  Air  force  Base 
Washington,  DC  J035J 


11  TITLE  Include  Security  Cluuificaiion  > 

"A  NOIL  ON  EXTENDED  iMIASf-l  1KI’  f! d >• )() 


12.  personal  auTmORiSi 

Davitiian,  Marie  and  I'.imll,  K.C 


13«.  type  OP  REPORT 


16.  SUPPLEMENTARY  NOTATION 


5  MONITORING  ORGANIZATION  REPORT  NUMBERISI 

AFOSR  TK-  87*  1132 


?a  NAME  OF  MONITORING  ORGANIZATION 

Air  force  Office  of  Scientific  Research 


7b  AOQRESS  <C'i(y,  .Mute  and  /IP  Code/ 

'x  V  ’  ^ 

e  v  '  n  O  O'' 


10  SOURCE  OF  FUNDING  NOS 


PROGRAM 
ElEMENT  NO 


PROJECT 

NO. 


WORK  UNIT 
NO. 


U\0AJ  i'*. 


14  DATE  OF  REPORT  ,  V'r  .  \to.,  Dayt  |  1  5.  P AGE  CO UNT 

l‘u!u*uarv  FJS? 


17  cos at i  cooes 


GROUP  '  SUB 


Id  S'-8jfcG^  'HMS  I  -n  nn^-n1  d  •u't,fi*an  jna  n«n(t/>  6y  (>loc«  numlwr) 

v;  *1  :  !■•.  :  i.  i  .'v  ed.ist  ic  regression  model, 

;  i  *  * ;  •  i  -  -  i  ;  .i  r.aaet  ers  ,  pseudo  -  I  i  he  I  i  hood 

;  -  :  i.'.i  t  i  ui  <  ,  t  i  m.<  t  i  on  . 


19.  ABSTRACT  Con  tinut  on  rtvene  <  f  necetsar-,  jna  .den  it  •  .  *>»  ••  '*• 

We  study  the  method  ,»!'  e\f  ended  ...  i  . 

This  method  is  shown  to  he  cl  >--el.  n  l.ited  • 
as  in  Carroll  li  Ruppert  illJs_'i. 


:  •  !  ■ t  ton  of  the  variance  function, 

o • Uod  ot  jeieudo -  1  i he  1  i hood  estimation 


JE% 

W  OCT  0  6  1987  |Jj 


20.  OlSTRl  BUTION/A  V  AILABI  LI  T  Y  OF  ABSTRACT 

UN  CLASS  I  FlED/UNLlMlTEC,  X  SAME  AS  RPT  _ ,  O  T  i  C  USE  RS 


22a.  NAME  OF  RESPONSIBLE  INDIVIDUAL 

fcisirfaNmks  \  ,  / 


DD  FORM  1473,  83  APR 


21  ABSTRACT  SECURITY  CLASSIFICATION 


22b  TELEPHONE  Number^  ,  ,  22c  OFFICE  SYMBOL 

Include  \n?a  Code  t 


y 1 u J3u: 


EDITION  OF  1  jAN  73  (S  OBSOLETE 


UNLESS  I  ]■  11  1) _ 

SECURITY  CLASSIFICATION  OF  THIS  PA 


'  V  N/  N 

CvtfK 

mr- 

nciAt 

yvWV-v-  -/  v  •«-  »_• 
■'■w'jLyV'Vyo-v- .  -r.  s'.  ■>;, 

AFOSK-Tk-  es  7  -  1  13  2 


THE  INSTITUTE 
OF  STATISTICS 

THE  CONSOLIDATED  UNIVERSITY 
OF  NORTH  CAROLINA 


A  NOTE  ON  EXTENDED  QUASI -LIKELIHOOD 


by 

Marie  Davidian 
and 

R.  J.  Carroll 

Mimeo  Series  #1716 

February  1987 

I jKPA KTMK.VT  OF  STATISTICS 
Chapel  Hill,  Nmth  Carolina 


A  NOTE  ON  EXTENDED  QUASI-LIKELIHOOD 


M.  Davidian 
R.J.  Carroll 

Department  of  Statistics 
University  of  North  Carolina  at  Chapel  Hill 
321  Phillips  Hall  039  A 
Chapel  Hill.  North  Carolina,  USA 
27514 


SUMMARY 


We  study  the  method  of  Extended  Quasi-Likel ihood  estimation  of  a 
variance  function.  This  method  is  shown  to  be  closely  related  to  the 
method  of  pseudo- 1 ike 1 ihood  estimation  as  in  Carroll  &  Ruppert  (1982). 
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1 .  INTRODUCTIOH 


Consider  the  following  mean-variance  model  for  observable  data  y- 

E(yi )  =  =  Vi(P)  =  f  (^i  ./3)  ;  var(y.)  =  a2g2(ni  ,  z .  ,  0)  .  (1.1) 

Here,  is  the  ith  response  variable  of  N  independent  observations, 

(x^.z^)  are  associated  vectors  of  covariates,  f  is  the  regression  function, 
p  is  a  p-vector  of  regression  parameters,  a  is  a  scale  parameter,  and  g  is 
the  variance  function  with  variance  parameter  0  (r  x  1).  For  example,  the 
variance  may  be  modeled  as  proportional  to  an  unknown  power  of  the  mean: 

gfpij.Zj.e)  =  pi®,  >  o.  (i.2) 

Special  cases  of  (1.1)  are  used  in  applications  such  as  radioimmunoassay, 

econometrics,  and  chemical  kinetics.  Model  (1.1)  includes  the  class  of 

generalized  linear  models,  see  McCullagh  &  Nelder  (1983). 

A  usual  aim  is  the  estimation  of  P,  with  estimation  of  the  variance 

function  parameters  as  an  adjunct.  However,  as  discussed  by  Davidian  & 

Carroll  (1987)  and  Carroll,  Davidian  &  Smith  (unpublished),  estimation  of 

the  variance  function,  in  particular  the  parameter  0,  is  an  important 

problem  both  for  estimation  of  P  and  in  its  own  right. 

Most  methods  for  estimating  0  are  "regression"  methods  based  on 

generalized  least  squares.  In  these  techniques,  0  and  a  are  estimated  by  a 

weighted  regression  of  some  function  of  the  absolute  residuals  from  a  fit 

PM  on  their  expectations.  For  example,  in  location-scale  problems  squared 

2 

residuals  have  approximate  mean  proportional  to  g  (pi^,z^,0)  and  variance 

4 

proportional  to  g  (pi.  ,z.,0).  Thus  an  estimate  of  0  can  be  obtained  by  a 


-  2  - 


2  2  A 

generalized  least  squares  regression  of  squared  residuals  on  a  g  (pj.Zj.B) 

^  A  A 

with  variance  function  g  (p^.z^.0),  where  =  f(x^,0M).  A  related  method 
is  the  pseudo- likelihood  approach  of  Carroll  &  Ruppert  (1982).  In  this 

A 

method,  one  pretends  /3  =  (3*  and  then  estimates  (a, 6)  by  normal  theory 

A 

maximum  likelihood,  maximizing  6 ,cr)  ,  where 

N 

£pL(P.e.CT)  =  -N  log  a  -  2  log  [g{|Ji(P),zi,e}] 

N 

-  (2a2)'1  I  (y  -  f(x  P))2/g2{p  (/3).z..0)  .  (1.3) 

i=l  11 

This  process  may  be  iterated  with  a  generalized  least  squares  routine  for 
P-  The  pseudo- 1 ike 1 ihood  method  is  asymptotically  equivalent  to  weighted 
regression  on  squared  residuals,  and  full  iteration  of  such  a  regression 
yields  the  pseudo-likelihood  estimate.  Both  methods  can  be  modified  to 
account  for  loss  of  degrees  of  freedom  for  preliminary  estimation  of  P  as 
in  Harville  (1977);  for  a  discussion  and  a  review  of  many  common  methods 
for  estimation  of  0,  see  Davidian  &  Carroll  (1987). 

Pseudo- 1 ike  1 ihood  and  weighted  squared  residual  estimation  are  based 
upon  the  method  of  moments.  Welder  &  Pregibon  (1987)  instead  attempt  to 
define  a  family  of  distributions  with  mean  and  variance  functions  given  by 
(1.1),  this  class  including  as  special  cases  skewed  distributions  such  as 
the  Poisson  or  gamma.  Their  extended  quasi- likelihood  is 

^(P.e.cr) 
where 


=  (-1/2)  2  [log  {2iro2g2(y  ,z  ,0)}  +  D(y  ,p  (0),z  }/a2], 

i=l  ill 

(1.4) 

V 

D(y,p,0)  =  -2  F  — - dw. 

J  8 (w.z.0) 


The  function  £  is  sometimes  but  not  always  an  exact  log-1 ikel ihood. 
Under  (1.2),  if  0=0,  normal  log-likelihood;  for  0  =  1.5,  that 

of  the  inverse  Gaussian.  For  0  =  .5,  a  =  1,  differs  from  the  Poisson 

log-likelihood  by  replacing  y^ !  by  its  Stirling  approximation;  for  0=1, 
differs  from  the  gamma  log-1 ikel ihood  by  a  factor  depending  on  a.  One 
motivation  for  (1.4)  is  the  Edgeworth  expansion  of  Barndorf f-Nielsen  and 
Cox  (1979)  or  the  related  saddlepoint  approximation  of  Daniels  (1954), 
which  yield  an  expansion  for  the  density  of  the  mean  of  m  random  variables 
from  a  one  parameter  exponential  family  as  m  The  leading  term  of  the 

expansion  at  m  =  1  is  the  extended  quasi- likelihood  summand.  See  Efron 
(1986)  for  a  related  formulation.  Note  that  the  form  of  may  be 

unsatisfactory  in  situations  for  which  g(y,z,0)  =  0  for  y  =  0.  In  this  case 
Nelder  &  Pregibon  suggest  replacing  g(y,z,0)  by  g(y+c,z,0)  for  some  c;  we 
use  this  adjustment  where  applicable  in  our  discussion. 

An  additional  reason  for  considering  approximate  likelihoods  for  a 
mean-variance  model  is  that  linear  exponential  families  with  given 
mean-variance  relationship  do  not  always  exist.  For  example,  Bar-Lev  &. 
Enis  (1986)  have  shown  that  if  the  distribution  of  yt  is  an  exponential 
family  with  variance  function  (1.2),  it  is  necessary  that  0  €  (-<“,0)  U 
(0,1/2),  so  that  such  a  family  exists  only  when  0  €  (0)  U  [1/2, °°),  and  the 
general  form  for  the  density  parameterized  in  terms  of  0  and  a  is  unwieldy. 

We  have  observed  in  many  examples  that  the  pseudo- 1  ikel  ihood  and 
quasi-likelihood  methods  lead  to  similar  estimates,  although  sometimes 
inferences  for  0  are  substantially  different.  In  Section  2,  we  construct 
an  asymptotic  theory  for  extended  quasi-likelihood  which  allows  an  easy 
illustration  of  the  relationship  between  the  two  methods  and  suggests  a 
simple  motivation  for  the  form  of  extended  quasi-likelihood.  We  show  that 


.'-.vy  v  .v.  •  ; 

*  <”***  **«.  ■•f  r  »  *"  Jt  v**  **«»  f «' • 


V-  -S<*'  -tMm  •„*  «^*  «.,•  ^  . 


under  certain  conditions  the  two  estimators  are  asymptotically  equivalent, 
although  extended  quasi-likelihood  can  be  affected  by  an  asymptotic  bias 
while  pseudo- 1  ike 1 ihood  is  not  when  the  underlying  distribution  is 
asymmetric.  In  Section  3  we  discuss  inference  for  0  based  on  the  two 
approaches.  From  the  theory  of  Section  2  we  observe  that  while  inference 
based  on  asymptotic  theory  for  the  two  approaches  may  yield  similar  results 
under  some  conditions,  such  a  test  based  on  extended  quasi-likelihood  can 
be  adversely  affected  by  possible  asymptotic  bias  of  the  estimator,  a 
problem  not  shared  by  pseudo- 1  ike 1 ihood  tests. 


2.  SOME  ASYMPTOTIC  RESULTS 

Neither  pseudo- 1 ike 1 ihood  nor  extended  quasi-likelihood  are  exact 
likelihood  approaches.  Pseudo-likel ihood  is  based  on  the  method  of 
moments,  so  that  the  estimating  equations  are  unbiased  and  hence 
consistency  and  asymptotic  normality  obtain  under  very  general  conditions 
even  without  the  assumption  of  normality.  Let  ufp^.z^.O)  =  log  gfp^.Zj.O), 
UgCp^z^.B)  be  its  column  vector  of  partial  derivatives  with  respect  to  0, 
W0^4i'Zi'6^  =  u0^i,zi’0)  ~  N  *2  u0(pj,Zj,0),  and  f(p,0)  be  the  limiting 
covariance  matrix  of  the  u0.  Let  subscripts  denote  differentiation  with 
respect  to  the  argument,  e.g.,  g^p^.z^.O)  =  dgfp^ ,  z^ ,  0)/dp^ .  Define  the 
errors  =  (y^-p^J/fa  g(p^,zlt0)},  and  assume  the  {e^  are  independent 
with  skewness  and  kurtosis  x^;  x^  =0  for  normality.  Let  i  =  (T).0t)t 
and  use  subscripts  PL  and  QL  to  denote  pseudo-likelihood  and  extended 


quasi-likelihood,  respectively. 


S8 


1 


CM 


RESULT  1  (Davidian  and  Carroll,  1987).  Suppose  that  (PH-p)/a  =  0p(N  ) 
and  fp^  -  -r  =  0p(N  )•  Then  0p^  is  asymptotically  normally  distributed 
with  mean  8.  If  a  -»  0  simultaneously  with  N  -»  00 ,  then 


1/2  ~ 

N  (Sp,  -  6) 


-1  -1/2  2 
{2f (p. 0) }  N  1A4  2  (ef 

i=l 


-  l)w0(pi,zi,0)  +  o(l). 


(2.1) 


If  the  {e^}  are  identically  distributed  with  kurtosis  k,  the  covariance 

A 

matrix  of  the  asymptotic  distribution  of  0p^  is  given  by 


(2  +  k)  (4N  f(p.0)}  \ 


(2.2) 


The  assumption  a  -»  0  is  a  useful  simplification  technically  and  is 

relevant  in  applications  where  a  is  '’small"  relative  to  the  means  as  in 

radioimmunoassay,  see  Carroll,  Davidian  &  Smith  (unpublished).  In  the 

gamma  and  lognormal  distributions,  a  is  the  coefficient  of  variation,  which 

is  often  fairly  small.  Alternatively,  think  of  yi  as  the  mean  of  m 

2  -1/2 

observations  with  mean  p^  and  variance  g  (p^.z^,0),  equate  o  and  m  ,  and 
let  m  -» 

The  assumption  a  -*  0  yields  a  motivation  for  (1.4).  Since  the  goal  of 
extended  quasi-likelihood  is  to  describe  a  class  of  distributions  "nearly” 
containing  exponential  families,  consider  a  density  h  such  that 


log  h(y ,a, 0,ct)  =  (ya  -  b(a)}/o  +  c(y,0  a) 


(2.3) 


for  some  b,  c,  and  a  =  a(p,0).  To  satisfy  (1.1)  we  require  db(a)/da  -  p 


/4  /  A  * 


-  6  - 


and  5^b(a)/3a^  = 
2 

g  (p,z,0)  =  du/da. 


g2(p,z,0),  implying  that  p  =  (3b(a)/3p)  d\i/da  and 

This  yields,  writing  b  now  as  a  function  of  p. 


M  9  M  o 

a  =  /  (l/gZ(u,z,0)}  du;  b(p)  =  /  {u/g  (u.z.6)}  du. 

— <X>  —00 


Plugging  into  (2.3)  gives  after  simplification 

log  h(y.a.a)  =  -  (2a2)  1  D(y,z,0)  +  d(y,0,a)  (2.4) 

for  some  function  d.  For  h  to  be  a  density  we  must  choose  d  so  that  h 
integrates  to  one;  by  approximating  the  first  term  on  the  right  side  of 
(2.4)  when  a  is  small  we  may  approximate  d.  Since  when  a  is  small  we  have 
a"2  D(y.p.z)  35  (y-u)2/{o2g2(y .  z ,  9) )  .  d  S  -(1/2)  log{2Tra2g2(y .  z .  0) . 

Inserting  this  in  (2.4)  yields  the  summand  of  (1.4). 

The  fact  that  (1.4)  is  an  approximate  log- 1  ikel  ihood  implies  that  0^ 
need  not  be  consistent.  With  the  suggested  adjustment  c  =  1/6  as  in  Nelder 
&  Pregibon  (1987),  if  the  { y ^ )  are  dist.  ted  as  Poisson  with  means  {p^} 
talcing  on  values  1  and  4  in  equal  proportions,  the  theory  of  M-estimation 
as  in  Huber  (1981  p.  130-132)  implies  that  0^  converges  to  0.675;  if  the 
{p^}  take  values  1  and  5,  0^  converges  to  0  640  If  the  (p.)  take  on 
larger  values,  such  as  30  ,  40  and  50  in  equal  proportions,  however,  0^^ 
converges  to  0.500. 

Since  the  estimating  equation  for  the  extended  quasi-likelihood 

A  A 

estimate  8^  can  be  biased,  standard  asymptotic  theory  for  0q^.  while 
possible  to  construct,  is  not  fully  informative.  As  an  approximation  we 
use  the  small  a  assumption  to  construct  an  asymptotic  theory.  We  also 
describe  an  approach  suggested  by  the  Poisson  case  for  "large"  {p^}. 


1/0  ~  1  /O  ~ 

RESULT  2.  Suppose  that  N  (-r^  -  nr)  =  Op(l)  and  N  (0^-  j3)/o  =  Op(l) 
1 /o 

if  N  a  ->  X  >  0  as  N  -►  ®,  o  -►  0.  Then 


N1/2(0ql  -  e)  =  {2  f(M.e)}'1  N  1/2  2  (e2  -  1)  UgOij.Zj.0) 

+  (N1/2ct)  {6  ffci.O)}'1  +  o(l),  where 


(2.5) 


=  N  2  C.{g(pi.zi>0)  u  (4..z1(0)  -  2  g  (M..ZJ.0)  .  z .  .  0)  }  . 


A  sketch  of  the  proof  is  contained  in  the  Appendix.  The  implication  of 

A  A 

(2.5)  is  that  while  0p^  and  0q^  behave  similarly,  they  differ  in  an 
asymptotic  fashion  through  the  second  term  on  the  right  hand  side  of  (2.5) 
in  a  way  that  might  affect  asymptotic  inference.  For  example,  if  the  (e  } 

A 

are  identically  distributed  with  skewness  f  and  kurtosis  x,  then  0q^  is 
asymptotically  normal  with  covariance  (2.2)  and  mean 


1/9  -1 

0  +  (6Ii  f (fi,0) }  (AfC).  where  -*•  C. 


From  (2.5),  0^  and  0p^  will  be  asymptotically  equivalent  only  if  A 

-*  0;  the  latter  will  occur  for  symmetrically  distributed  data. 

In  the  case  of  (1.2),  uq(ji  ,z .,0)  =  log  n.  so  that 

U  1  1  1 

.1  N  0,  _  N 

Cn  =  N  ^  Cj  {1  -  20  (log  ni  -  1N)}.  1N  =  2^  log  |ij. 


For  the  normal  distribution,  f  =  x^  =  0;  for  the  gamma,  lognormal,  and 

2 

inverse  Gaussian  distributions  =  0(o)  and  x^  =  0 (a  ),  so  that  the 
asymptotic  bias  is  0  and  the  two  estimators  are  asymptotically  equivalent 
with  covariance  the  same  as  if  the  data  were  normally  distributed  with  mean 


p^  and  variance  a  p^  .  From  Bar-Lev  &  Enis  (1986),  =  0(a)  for 

distributions  which  are  exponential  families  with  0  €  {0}  U  [1/2,00).  If 

the  (y . }  are  not  from  an  exponential  family,  the  asymptotic  bias  need  not 

be  zero.  For  example,  consider  a  shifted  gamma  model  y.  =  p^  + 

ag(p^ , z^ , GJfej ,  where  w^  has  a  gamma  (ac^  •  ^^ )  distribution  with  E(w^)  = 

2-1/2  2 

a./#.  ,  and  e ^  =  (w^  -  (a^/^)}  (a./<>.)  .  so  that  =  1.  In  this 

-1/2 

case  (.  =  2  a.  ,  so  that  if  the  { a do  not  depend  on  a.  the  asymptotic 
bias  will  not  vanish. 

An  asymptotic  theory  for  which  the  means  are  "large"  in  which  a 

remains  fixed  yields  a  similar  result  under  (1.2)  if  9  <  1.  Let  p^  ^  be  a 

sequence  to  be  chosen  shortly.  Define  pj  =  p^/p^  ^  a™!  Yj  =  yVPQ  N  S° 

that  ei  =  (y^  -  p^/ffi  ),  where  6  =  a  p^  ^  ’ .  If  as  N  -♦  ®,  min^  pi  -*  00 

and  Pq  ^  -*  00  in  such  a  way  that  the  (p^  and  the  {y^}  are  well-behaved, 

then  if  0<  1,  6  -*  0  as  N  -»  ®  so  that  the  calculations  here  parallel  those 

for  the  case  of  small  a.  By  analogy,  the  small  a  part  of  Result  1  holds. 

Replacing  a  by  5  in  (2.5),  in  the  Poisson  case  for  which  0  =  .5  and  <7=1, 
-1/2-1 

fj  =  Pj  and  =  pi  ,  so  that  -»  0  and  the  limiting  covariance  of  Oq^ 
is  as  if  Kj  =  0.  Thus,  in  the  case  of  "large”  means  and  data  distributed 
as  Poisson,  extended  quasi-likelihood  and  pseudo- 1 ike 1 ihood  will  behave 
simi larly . 


3.  INFERENCE  FOR  8 

The  asymptotic  distribution  theory  of  Section  2  can  be  used  to 
construct  tests  of  HQ:  0  =  0Q.  For  simplicity  consider  identically 

A  ^  A 

distributed  (e^  with  kurtosis  k.  From  (1.3),  0p^  maximizes  fp^O^.0), 


where 


-  9  - 


«pL(/3,0)  =  -N  log  opL(0,e)  -  2  log  g{M-.(J3)  .z^G}  , 

N 

apL(P.0)  =  2  {(Yi  -  f(x.,/3)}2/g2{p.(P).z.,0}. 

i=l 


One  might  reasonably  base  inference  for  0  on  a  test  statistic 


tn  =  -2  [<pLiP(s0).e0>  -  ?PL{(J(8PL).0PL}], 


where  /3(0)  denotes  a  generalized  least  squares  estimate  computed  at  0. 

Result  1  may  be  used  to  show  that  under  Hq,  {2/(2+*c)}  is  asymptotically 
2 

distributed  as  v  ,  so  that  a  test  based  on  this  statistic  with  k 
r 

appropriately  estimated  is  an  asymptotic  a-level  test.  McCullagh  & 
Pregibon  (1987)  consider  estimators  for  the  cumulants  for  linear  regression 
mode 1 s . 

Nelder  &  Pregibon  (1987)  suggest  a  likelihood  ratio  type  test  based  on 
treating  the  extended  quasi-likelihood  as  an  actual  likelihood.  Such  a 
test  is  based  on 


QN  “  2  ^eQL^0o)’0O^  *QL^0QlJ’0QL^’ 

where 

*  N 

^(^.9)  =  -N  log  CTqj^P.0)  -  2  log  g(y.,zi,0), 

2  -1  N 

<7^03.0)  =  N  1  2  D{y1.4i(P).z.}. 


In  the  situation  of  Result  2,  under  Hq  (2/(2+x)}  is  asymptotically 

2 

distributed  as  noncentral  \  with  noncentrality  parameter  A  = 

r 

X2f2Ctf (p,0)_1c  {9(2+x)}_1.  As  long  as  A  =  0,  comparing  this  statistic  to 


rV V 


■-  k\.  /.  *v  -  ’  - 

■  v£* of. 


the  percentiles  of  the  xr  distribution  is  an  asymptotic  a-level  test  which 

is  asymptotically  equivalent  to  the  test  based  on  T^.  Nelder  &  Pregibon 

2 

suggest  comparing  directly  to  the  percentiles  of  the  xr  distribution, 
not  accounting  for  the  factor  2/(2+x).  In  the  saddlepoint  approximation 
approach,  m  -»  00  implies  k.  -»  0,  thus  they  observe  that  if  the  underlying 
distribution  of  the  data  is  known  to  be  from  an  exponential  family,  then 
such  a  test  is  asymptotically  valid.  In  our  asymptotics,  for  the  cases  of 
the  normal,  Poisson,  gamma,  and  inverse  Gaussian  examples  cited  in  Section 
2  we  see  this  to  be  the  case.  We  further  obtain  the  correct  form  and 
properties  for  a  test  of  this  type  when  only  the  mean-variance  relationship 
is  specified. 

For  a  model  such  as  (1.1)  for  which  only  the  mean  and  variance  are 
specified,  interest  in  8  may  be  in  the  context  of  trying  to  understand  the 
structure  of  the  variances,  not  the  form  of  the  underlying  distribution.  A 
test  based  on  will  approach  its  nominal  level  only  if  A  =  0.  Thus,  when 

A 

the  underlying  distribution  of  the  data  is  such  that  8q^  is  biased 

asymptotically  so  that  A  ^  0,  in  the  case  of  a  nonsymmetric  error 

2 

distribution  for  example,  the  validity  of  a  \  test  based  on  may  be 
seriously  affected  in  that  there  will  be  bias  in  the  asymptotic  levels  of 
the  test. 
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APPENDIX  :  SKETCH  OF  PROOF  OF  RESULT  2 


For  convenience,  let  77  =  log  a  and  let  (/3,0,r))  be  the  joint  extended 


quasi- likelihood  estimators  for  (P.O.tj).  Let  r(i,(3,6)  =  {1  ,  1^(7^  ,  z^ ,  0) } * 


P(i.P.e)  =  fj3(x../?)/g{/iii(/3).zi.0}.  ^  =  -4N  *2  r(i,p,e )  T(i,p,Q)1.  JN  = 


2N  *2  e.  p(i  ,/3,0)  T(i.p,0)t  and  =  N_12  p(i./3.0  p(  i,p.e)Z.  From  (1.4). 


(P  ,0,77)  solves 


02,N(P.e.n) 


where 


(A.l) 


Q1n(P.0.t])  =  N  1/2  ^  e  211  (Y.  -  p.)  p(i,0,0)/g(p. .z.,0); 


02  N(^.9.T7)  =  N  1/2  ^  {  e  2t?  D(Yi,fx..z.)  -  1}: 


Q3  N(p.  0.77)  =  N  1/2  2  [  -  |  e  217  3D(Y  ,n  ,z  )/30  -  (dlog  g(Y  z  .0)/30}]. 

i=l  '  1 


The  following  result  is  shown  by  assuming  appropriate  smoothness  conditions 


for  g  so  that  D  may  be  differentiated. 


LEMMA  A.  Under  regularity  conditions. 


..  N  N 

-1  9  -1  *  9  -1  T  4 

N  2  D(Y  p  z  )  =  ct  N  2  ef  +  a  N  2  s  eT  +  0  (a  ) ; 

i=l  1=1  1  i=l  ’  P 


V* \  "*■*..'** V  *»”./■  ‘ 


w\  \  -‘./s, 
^ V- s*  . 


"j*  *  \»  -  •  ■ 1 *  - «  ■„*  ■ 


12  - 


-1  N  2  -1  N  2 
N  2  9D(Y . ,p . , z .  )/dd  =  -2  {  o  N  2  (1.0.0) 

1=1  111  i=1  i 

N 

+  N"1  2  s  a3  }  +  0  (a4); 

1=1  1  P 


lS  IS 

N-1  2  d^D{\ .  ,fi .  ,x  ,)/dQdQ^  =  2  a2  N_1  2  e2  [3  u  (1  ,0.0)  uj(i  .0.0) 

1=1  1  1  1  1=1  1 

-^g00^i,zi,e^  7  S(»W0)}  ]  +  °p^3)1 

where  g0e(ni,z..0)  =  32{g(ji.  ,  z.  ,  0)  >/a090 11 ,  s^.  =  -2  g^(4.  .  z.  .  0)/3,  and 

s2, 1  =  g0u^ilZi’0)/3  ■  VW9*  gp^i'zi*9>- 


A  Taylor  series  in  (A.l)  using  consistency,  a  Taylor  series  in  ct  about 
0  using  Lemma  A  and  laws  of  large  numbers  yield  after  simplification 


1/2  (P  '  P)/o 


r?  ~  V 


0-0 


(A. 2) 


-1/2 

N  2  e  p(l ,p,G) 
1  =  1 


-1/9  1  2 

N  Z  2  {e  -  1)  t( i ,0, 0) 
i=l 


+  (N1/2a) 


M-i  :  3 

N  2  e .  s . 

1=1  1  1 


+  V1}' 


where  =  (s^  s^  i).  Equation  (A. 2)  implies  that,  as  N  -*  °°,  a  -*  0, 


Equation  (A. 2)  also  shows  that  in  these  asymptotics  0 is  equivalent  to  a 


generalized  least  squares  estimator  for  0.  □ 
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